Automatic Music Tagging With Time Series Models
نویسندگان
چکیده
We present a system for automatic music annotation that leverages temporal (e.g., rhythmical) aspects as well as timbral content. Our system estimates a dynamic texture mixture (DTM) density over times series of acoustic features (instead of on individual features) for each tag in a semantic vocabulary. When analyzing a new song, our system processes the time series of acoustic features of the song and outputs a semantic multinomial, i.e., a vector of tag-affinities. A song is then annotated by selecting the top-ranking tags in its semantic multinomial. Tag-DTM models are estimated efficiently with the hierarchical EM algorithm for DTM (HEM-DTM) from all the DTMs modeling individual songs associated with a tag. E. Coviello, L. Barrington, A. Chan, and G. Lanckriet, “Automatic Music Tagging with Time Series Model”, ISMIR 2010, Utrecht (Netherlands). 9 13 Aug. 2010 E. Coviello, A. Chan, and G. Lanckriet, “Time Series Models for Semantic Music Annotation”, Transactions on Audio, Speech and Language Processing, 19-5, pp 1343 1359 . 1. MODELING AUDIO AND TAGS Our auto-tagging music information retrieval (MIR) system takes as input an audio track and computes the relevance of all the tags in a vocabulary to the audio track. The systems is based on the models in [3,4] previously applied to video and audio retrieval.
منابع مشابه
Automatic Tagging Using Deep Convolutional Neural Networks
We present a content-based automatic music tagging algorithm using fully convolutional neural networks (FCNs). We evaluate different architectures consisting of 2D convolutional layers and subsampling layers only. In the experiments, we measure the AUC-ROC scores of the architectures with different complexities and input types using the MagnaTagATune dataset, where a 4-layer architecture shows ...
متن کاملAutomatic Identification and Classification of the Iranian Traditional Music Scales (Dastgāh) and Melody Models (Gusheh): Analytical and Comparative Review on Conducted Research
Background and Aim: Automatic identification and classification of the Iranian traditional music scales (Dastgāh) and melody models (Gusheh) has attracted the attention of the researchers for more than a decade. The current research aims to review conducted researches on this area and consider its different approached and obstacles. Method: The research approach is content analysis and data col...
متن کاملسیستم برچسب گذاری اجزای واژگانی کلام در زبان فارسی
Abstract: Part-Of-Speech (POS) tagging is essential work for many models and methods in other areas in natural language processing such as machine translation, spell checker, text-to-speech, automatic speech recognition, etc. So far, high accurate POS taggers have been created in many languages. In this paper, we focus on POS tagging in the Persian language. Because of problems in Persian POS t...
متن کاملImproving Auto-tagging by Modeling Semantic Co-occurrences
Automatic taggers describe music in terms of a multinomial distribution over relevant semantic concepts. This paper presents a framework for improving automatic tagging of music content by modeling contextual relationships between these semantic concepts. The framework extends existing auto-tagging methods by adding a Dirichlet mixture to model the contextual co-occurrences between semantic mul...
متن کاملMultivariate Autoregressive Mixture Models for Music Auto-Tagging
We propose the multivariate autoregressive model for content based music auto-tagging. At the song level our approach leverages the multivariate autoregressive mixture (ARM) model, a generative time-series model for audio, which assumes each feature vector in an audio fragment is a linear function of previous feature vectors. To tackle tagmodel estimation, we propose an efficient hierarchical E...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010